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© A data processing system including a circuit for 
storing a sequence of instructions, a circuit for deter- 
mining if the the instruction sequence includes a 
branch instruction, a circuit for storing a sequence of 
branch target instructions in response to the deter- 
mination of the existence of a branch instruction in 
the stored sequence of instructions, a circuit for 
dispatching instructions in sequence after the branch 
instruction to a processor to be executed on con- 
dition that a branch is to be taken before a deter- 
mination of whether said branch will be taken and 
simultaneously for determining if the branch is to be 
taken, any circuit for directing the processor to ex- 
ecute the instructions in sequence after the branch if 
the branch is not taken, or, if the branch is to be 
taken, for dispatching the branch target instruction 
sequence to the processor for execution. 
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BRANCH INSTRUCTION EXECUTION APPARATUS 



This invention relates to apparatus and meth- 
ods for executing branch instructions. 

This patent application is related to U.S. Patent 
Application, Serial No. 07/126,818, entitled "Parallel 
Processor Instruction Dispatch Apparatus With In- 
terrupt Handler" and U.S. Patent Application. Serial 
No. 07/135,170, entitled "Dynamic Buffer Control". 

An obstacle to the fast execution of instructions 
in a data processing system is the execution of a 
branch instruction. Traditional processors execute 
instructions in a serial fashion, i.e., one instruction 
at a time. Therefore, instructions are executed se- 
quentially in a normal mode. However, when a 
branch instruction is encountered, the instruction 
sequence may be varied, i.e., the instruction follow- 
ing the branch instruction may not be the new 
instruction executed. In traditional data processing 
systems, the execution of instructions is delayed 
until a branch instruction execution can be com- 
pleted. This is because the result of the branch 
instruction execution must be known in order to 
determine which instruction is to be executed next. 
In data processing systems executing conditional 
branch instructions, a pre-existing condition must 
be determined before it will be known whether or 
not a branch may be taken as a result of a con- 
ditional branch instruction execution. As a result, 
the execution of a conditional branch instruction is 
delayed until the specified condition for the branch 
has been determined. In traditional processors the 
requirement for the resolution of conditions to per- 
form branches has not been as much a problem as 
in more modern processing systems. In a pipeline 
processing system where an instruction is being 
fetched, dispatched, decoded and executed at dif- 
ferent cycle times, the execution of branch instruc- 
tions will result in delays since the execution phase 
must be completed for a conditional branch before 
the data processing system knows which instruc- 
tions to fetch for execution after the branch. 

The use of multiple processors in a data pro- 
cessing system increases the conditional branch 
execution problem even more since it is advanta- 
geous to distribute instructions in parallel to the 
multiple processors for execution. However, if the 
multiple processor system is to emulate the execu- 
tion of instructions in a serial fashion, the execution 
of instructions after the occurrence of a branch 
instruction must be delayed until the branch in- 
struction execution is completed. 

An example of a prior art instruction dispatch- 
ing apparatus for a multiple processing system is 
disclosed in the IBM Technical Disclosure Bulletin. 
Vol. 29, No. 1, June, 1986, entitled "Branch-Pro- 
cessing Instruction Cache", pages 357-359. This 



disclosure illustrates a mechanism for fetching 
instructions and includes the capability to process 
branch instructions while other instructions are be- 
ing fetched. 

5 Accordingly the present invention provides a 
data processing system comprising means for stor- 
ing a sequence of instructions; means for determin- 
ing if said instruction sequence includes a branch 
instruction; means responsive to said determining 

w means for storing a sequence of branch target 
instructions in said storing means; means for ex- 
ecuting said branch instruction; means for dis- 
patching instructions in said sequence to a proces- 
sor for execution and dispatching instructions in 

75 said sequence after said branch instruction to said 
processor to be executed on condition that a 
branch is not taken before a determination of 
whether said branch will be taken while simulta- 
neously determining if said branch is to be taken; 

20 and means for directing, if said branch is not taken, 
said processor to execute said instructions in se- 
quence after said branch instruction or, if said 
branch is taken, for dispatching said branch target 
instruction sequence to said processor for execu- 

25 tion. 

The present invention also provides a data 
processing system comprising means for storing a 
plurality of instructions in sequence; means for 
determining whether said plurality of instructions 

30 includes a branch instruction; means responsive to 
said determining means for storing a plurality of 
branch target instructions in sequence, when said 
firstmentioned plurality of instructions includes a 
branch instruction, means for dispatching one or 

35 more instructions to a processor for execution, 
means for dispatching one or more subsequent 
instructions after the branch instruction in the se- 
quence, for execution on condition that the branch 
is not to be taken, means for executing said branch 

40 instruction, thereby determining whether the branch 
is to be taken, means responsive to said means for 
executing, for directing said processor to execute 
said subsequent instructions if said branch is not to 
be taken, and means responsive to said means for 

45 executing, of said branch target instructions for 
execution, if said branch is to be taken. 

The invention may be viewed from a number of 
aspects; these will be apparent by inspection of the 
appended claims and sub-claims. 

so In accordance with the present invention a data 
processing system is provided that includes a cir- 
cuit for storing a sequence of instructions, a circuit 
for determining if said instruction sequence in- 
cludes a branch instruction, a circuit responsive to 
the determination of whether or not there is a 
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branch instruction for storing a sequence of branch 
target instructions in the storing circuit, a circuit for 
dispatching instructions in sequence after the 
branch instruction to a processor to be executed on 
condition that a branch is not taken before a deter- 
mination of whether the branch will be taken and 
simultaneously for determining if the branch is to 
be taken, and a circuit for directing, if the branch is 
not to be taken, the processor to execute the 
instructions in sequence after the branch instruction 
or, if the branch is to be taken, for dispatching the 
branch target instruction sequence to the processor 
for execution. 

In a preferred embodiment the invention in- 
cludes a data processing system having a memory, 
an instruction fetching and branch processing unit, 
and a fixed point processing unit, in the instruction 
fetching and branch processing unit a mechanism 
is provided for fetching multiple instructions at a 
time and for storing these instructions in buffers. In 
this embodiment one buffer is provided for sequen- 
tial instructions and a second buffer is provided for 
target instructions. Sequential instructions include 
the normal sequence of instructions that are being 
fetched and also instructions that are present after 
a branch instruction is encountered. Target instruc- 
tions are instructions that are to be branched to if a 
branch is taken as a result of a branch instruction 
execution. In the preferred embodiment the instruc- 
tion fetching and branch processing unit executes 
branch instructions while coordinating the dispatch- 
ing of instructions to the fixed point processing 
unit According to the teachings of this invention, in 
certain circumstances, branch instructions can be 
generated in a pipeline fashion without causing any 
pipeline cycle delays of instruction execution. 

This invention relates to the field of data pro- 
cessing and a provides a capability to execute 
branch instructions in a small amount of time. The 
objective of the invention is to execute a branch in 
zero cycles of a fixed point processor. 

In order that the invention may be fully under- 
stood a preferred embodiment thereof will now be 
described, by way of example only, with reference 
to the accompanying drawings in which:- 

Figure 1 is a block diagram of a data pro- 
cessing system; 

Figure 2a shows an instruction sequence 
containing an unconditional branch; 

Figure 2b is a timing diagram illustrating the 
pipelined instruction flow for the instruction se- 
quence in Figure 2a; 

Figure 2c shows an instruction sequence 
including a not taken conditional compare adjacent 
to a compare instruction; 

Figure. 2d is a. timing diagram illustrating the 
pipelined data flow of the instruction sequence in 
Figure 2c; 



Figure 2e shows an instruction sequence 
including a taken conditional branch instruction ad- 
jacent to a compare instruction; 

Figure 2f is a timing diagram illustrating the 
5 pipelined instruction flow for the instruction se- 
quence of Figure 2e; 

Figure 2g shows an instruction sequence 
including a compare instruction with instructions 
between the compare and the taken conditional 
w branch instruction; 

Figure 2h is a timing diagram illustrating the 
pipelined instruction flow for the instruction se- 
quence in Figure 2g; 

Figure 2i shows an instruction sequence in- 
J5 eluding a compare instruction with three intermedi- 
ate instructions adjacent to a conditional branch 
instruction; 

Figure 2j is a timing diagram illustrating the 
pipelined instruction flow for the instruction se- 

20 quence in Figure 2i; 

Figure 3 is a block diagram illustrating the 
contents of the instruction fetching and branch pro- 
cessing unit; and 

Figure 4 is a flow diagram illustrating the 

25 operation of the interlock, dispatch, branch execu- 
tion, and buffer control logic unit. 

Figure 1 is a block diagram illustrating a pro- 
cessor system according to the invention. In Figure 
1, memory 10 is connected by bus 12 to an 

30 instruction fetching and branch processing unit 14, 
a fixed point processing unit 20 and other process- 
ing units represented by block 22. The function of 
bus 12 is to provide a transfer of instructions and 
data from memory 10. The instruction fetching and 

35 branch processing unit 14 provides instructions 
over line 16 to the fixed point processing unit 20. 
Also conditional code information is provided to the 
instruction fetching and branch processing unit 14 
over line 18 by the fixed point processing unit 20. 

40 In response to the condition code information on 
bus 18 provided by the fixed point unit 20 to the 
instruction fetching unit 14, branch execution logic 
in the instruction fetching unit 14 generates an 
execute conditional signal via line 41 if the branch 

45 is not to be taken or generates the cancel con- 
ditional signal via line 43 if the branch is to be 
taken. 

Figures 2a to 2j represent example instruction 
sequences and resulting timing diagrams for the 

so _ execution of those sequences. In Figure 2a an 
instruction sequence is shown that includes an 
unconditional branch instruction. Specifically the 
unconditional branch instruction (BRU) branches to 
instruction T1 (the target instruction). The timing 

55 diagram illustrating the operation of the preferred 
embodiment with this instruction sequence is illus- 
trated in Figure 2b. In Figure 2b four phases of the 
pipeline operation are illustrated. The first phase is 
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the . instruction fetch operation (IF). The second 
phase is the instruction dispatch operation (Disp). 
During the second phase the instruction fetching 
and processing unit 14 detects branch instructions 
and initiates branch instruction execution. The third 
phase is the fixed point processor decoding opera- 
tion (FXD). The fourth is the fixed point processor 
execution of the instruction (FXE). Seven instruc- 
tion cycles, are shown. In cycle 1 , four instructions 
are fetched (from the cache 34 Figure 3) which 
include XI. X2. BRU and S1. Note that the prefix S 
designates the sequential instructions that occur 
after the unconditional branch instruction (BRU). In 
cycle 2 the instruction fetch operation fetches addi- 
tional sequential instructions (S2 and S3). At the 
same time during cycle 2, the dispatch operation 
dispatches instructions X1 and X2 to the fixed point 
processing unit 20. The BRU instruction is dis- 
patched and executed internally in the branch pro- 
cessing unit 14. The target address is generated in 
cycle 2. In cycle 3, as a result of the execution of 
the unconditional branch in the branch processing 
unit 14, the four target instructions are fetched, T1, 
T2, T3, T4. Also in cycle 3, instruction X1 is 
decoded in the fixed point processing unit 20. In 
cycle 4 an additional four target instructions are 
fetched (T5, T6, T7 and T8). Also in cycle 4 
instructions T1 and T2 are dispatched to the fixed 
point processing unit 20. Meanwhile in cycle 4 the 
fixed point processing unit 20 decodes the instruc- 
tion X2 and executes instruction X1 . In cycle 5, the 
target instructions T3 and T4 are dispatched to the 
fixed point processing unit 20 while the fixed point 
processing unit 20 decodes instruction T1 and ex- 
ecutes instruction X2. In cycle 6, instructions T5 
and T6 are dispatched while the fixed processing 
unit 20 decodes target instruction T2 and executes 
instruction T1 , Finally in cycle 7, the instruction T3 
is being decoded while instruction T2 is being 
executed in the fixed point processing unit 20. Note 
that the branch operation takes no execution cycles 
and causes no execution delay since the fetching 
of the target instructions was accomplished as a 
result of the early execution of the branch instruc- 
tion during the dispatch cycle 2. 

In Figure 2c an instruction sequence is shown 
that provides for a fixed point compare instruction 
(C) followed by a branch on condition instruction 
(BRC). If the branch is not taken, the sequential 
instructions following the branch will be executed 
(S1 , S2, and S3). However, if the branch is taken 
the target instructions are executed (TT, T2, and 
T3). In the following example the conditional branch 
is not .taken. The timing diagram in Figure 2d 
illustrates the operation of the processing unit. Ini- 
tially in cycle 1 four instructions are fetched includ- 
ing the condition instruction (C), the conditional 
branch, instruction (BRC), and the two sequential 



instructions S1 and S2. In cycle 2 the next four 
sequential instructions (S3 through S6) are fetched 
while the compare instruction C and the sequential 
instruction S1 are passed to the fixed point pro- 

5 cessing unit 20. The Si' indicates that instruction 
S1 is passed as an instruction for conditional ex- 
ecution. In other words, the execution of S1 is 
conditional on the event that the branch is not 
taken. The conditional branch instruction (BRC) is 

to passed to the branch processing unit 1 4 where it is 
executed. However, the completion of the execu- 
tion of the conditional branch is delayed until the 
compare instruction is executed in the fixed point 
processing unit 20. In cycle 3, because of the initial 

rs execution of the branch instruction, four of the 
target instructions are fetched (T1 through T4). 
Simultaneously instructions S2 and S3 are dis- 
patched to the fixed point processing unit 20. Note 
that these instructions are dispatched as condi- 

20 tional instructions that will be executed upon the 
condition that the conditional branch will not be 
taken. In the fixed processing unit during cycle 3, 
the fixed point processor decodes the compare 
instruction. In cycle 4 the next four target instruc- 

25 tions are fetched (T5 through T8). Also the next two 
sequential instructions S4 and S5 are dispatched 
as conditional instructions to the fixed point pro- 
cessing unit 20. The fixed point processing unit 20 
decodes the sequential instruction S1 while it ex- 

30 ecutes the compare instruction. At the completion 
of the execution of the compare instruction the 
result of the compare instruction is provided to the 
branch processing unit 14 indicating to the branch 
processing unit 14 that the branch will not be taken 

35 (i.e., that the sequential instructions should be ex- 
ecuted). The execute condition is asserted via line 
41 . Next in cycle 5 the fixed point processing unit 
20 decodes the sequential instruction S2 while it 
executes the sequential instruction S1. Also in cy- 

40 cie 5 the fetching of target instructions is sus- 
pended and the dispatching of target instructions is 
suspended. In cycle 6 the fetching of sequential 
instructions is resumed with the fetching of sequen- 
tial instructions S7 through S10. Also in cycle 6, 

45 the fixed point processing unit 20 decodes instruc- 
tion S3 and executes sequential instruction S2. 
Finally in cycle 7, the fixed point processing unit is 
decoding instruction S4 and is executing instruction 
S3 illustrating that the sequential instruction se- 

50 quence is being executed. Note that again the 
execution of the branch takes no execution cycles 
and causes no delay even though the branch was 
not taken. 

Figure 2e illustrates a conditional branch in- 
55 struction where the branch is taken. The sequence 
illustrated in Figure 2e includes a fixed point com- 
pare instruction C, a conditional branch instruction 
BRC followed by a set of sequential instructions S1 
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through S3 then followed by a sequence of target 
instructions T1 through T4.. The execution of this 
sequence is illustrated in Figure 2f. 

In Figure 2f in cycle 1 the four instructions are 
fetched (C, BRC, S1, and S2). In cycle 2 t the next 
four instructions are fetched (S3 through S6) and 
the compare instruction and the first sequential 
instruction (S1) are dispatched to the fixed point 
processing unit 20 while the conditional branch 
instruction BRC is initially executed in the branch 
processing unit 14. In cycle 3 four of the target 
instructions are fetched (T1 through T4). The se- 
quential instructions S2 and S3 are conditionally 
dispatched to the fixed point processing unit 20. 
Meanwhile, the compare instruction is decoded in 
the fixed point processing unit 20. In cycle 4 the 
next four target instructions are fetched (T5 through 
T8), while the sequential instructions S4 and S5 are 
conditionally dispatched to the fixed point process- 
ing unit 20. The sequential instruction S1 is being 
decoded in the fixed processing unit at the same 
time the compare instruction is being executed. As 
a result of the completion of the execution of the 
compare instruction in cycle 4, the branch process- 
ing unit 14 is informed that the branch should be 
taken. Therefore, in cycle 5 the sequential instruc- 
tion S1 whose execution is conditioned on the 
branch not being taken will not be executed as a 
result the assertion of the cancel conditional line 
43. Also even though the sequential instruction S2 
is being decoded it will be purged. In cycle 6 the 
first two target instructions T1 and T2 are dis- 
patched to the fixed point processing unit 20. In 
cycle 7 instruction T1 is being decoded and ex- 
ecuted in cycle T8. Note that the execution of this 
conditional branch results in a three cycle delay. 
This is a worse case condition since the maximum 
delay in this configuration is 3-i cycles where i is 
the number of fixed point operations that are lo- 
cated between the compare and branch on com- 
pare instructions. 

In Figure 2g another instruction sequence is 
illustrated that first includes a compare instruction 
followed by two fixed point instructions X1 and X2. 
Following X2 is a branch on condition of the com- 
pare instruction. Following the branch is the set of 
sequential instructions S1 and S3 followed later by 
the target instructions T1 through T4. The timing 
execution of these instruction sequences are illus- 
trated in Figure 2h. Referring to Figure 2h, in cycle 
1, the first four instructions are fetched (C, X1, X2 
and BRC). In cycle 2 the next four instructions in 
the sequence are fetched (S1 through S4) while the 
compare instruction and the first fixed point instruc- 
tion X1 are dispatched to the fixed point unit 20. 
Also during cycle 2, the instruction sequence that 
has been fetched has been scanned for branch 
instructions. Upon encountering a branch instruc- 



tion, the scanning logic determines if the target 
instructions of the branch can be computed. In the 
example given the target instruction addresses are 
computed and then in cycle 3 the first four target 

5 instructions T1 through T4 are fetched. Also during 
cycle 3 instruction X2 is unconditionally dispatched 
and sequential instruction S1 is conditionally are 
dispatched to the fixed point processing unit 20. 
The conditional branch instruction BRC is dis- 

10 patched to the processing unit 14. The fixed point 
processing unit 20 decodes the compare instruc- 
tion C. In cycle 4, the next four target instructions 
are fetched (T5 through T8) while sequential 
instructions S2 and S3 are dispatched to fixed 

is point processing unit. The fixed point processing 
unit 20 decodes instruction X1 and executes the 
compare C. As a result of the compare instruction 
the branch is taken. In cycle 5 instruction X2 is 
decoded by the fixed point processing unit 20 

20 while the instruction X1 is executed. However, no 
additional instructions are fetched during these cy- 
cles and no instructions are dispatched, in cycle 6, 
the fetching of target instructions would continue 
however for the simplicity of this example only 

25 eight target instructions have been fetched. In cy- 
cle 6, the target instructions T1 and T2 are dis- 
patched to the fixed point processing unit 20 during 
the execution of instruction X2. In cycle 7 instruc- 
tions T3 and T4 are dispatched while instruction T1 

30 is being decoded. Finally in cycle 8, instruction T1 
is executed by the fixed point processing unit 20. 
Note that the branch in this case only causes a one 
cycle delay since there were two intermediate 
instructions between the compare and branch on 

as condition instructions. 

In Figure 2i another example instruction se- 
quence is illustrated that includes three intermedi- 
ate instructions between the fixed point compare 
instruction and the conditional branch instruction. 

40 The timing diagram for the execution of this in- 
struction sequence is illustrated in Figure 2j. In 
Figure 2j during cycle 1 the first four instructions of 
this instruction sequence in Figure 2i are fetched. 
In cycle 2 the second four instructions are fetched 

45 while the first two instructions are dispatched. Note 
that since the first four instructions that were 
fetched did not contain the branch instruction, the 
scanning logic would not detect that there is a 
branch until cycle 3. Therefore in cycle 3 if addi- 

50 tional sequential instructions were available they 
would be fetched. Also in cycle 3 instructions X2 
and X3 are dispatched to the fixed point process- 
ing unit 20 while the conditional branch is dis- 
patched to the branch processing unit 14 while the 

55 fixed point processing unit decodes the compare 
instruction. In cycle 4, the target instructions T1 
through T4 are now prefetched as a result of the 
scanning of the conditional branch instruction in 
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cycle 3. Sequential instructions S1 and S2 are 
conditionally dispatched to the fixed point process- 
ing unit 20 while instruction X1 is decoded and the 
compare instruction is executed. At the completion 
of the execution of the compare instruction in the 
fixed. point processing unit 20, the branch process- 
ing unit is signalled that the branch will be taken. In 
cycle 5 the next four target instructions are fetched 
(T5 through T8) while in the fixed point processing 
unit. 20 instruction X2 is decoded and instruction 
X1 is executed. In cycle 6 the first of the two target 
instructions T1 and T2 are dispatched to the fixed 
point processing unit 20 while instruction X3 is 
decoded and instruction X2 is executed. In cycle 7 
the next* two target instructions T3 and T4 are 
dispatched while instruction T1 is decoded and 
instruction X3 is executed. In cycle 8 finally instruc- 
tion T2 is being decoded while the first of the 
target instructions T1 is being executed. Note that 
the execution of this conditional branch results in a 
zero cycle delay since the number of intermediate 
instructions between the compare instruction and 
the branch is equal to three. 
. Figure 3 is a block diagram illustrating the 
instruction fetching and branch processing unit 14. 
In Figure 3 an instruction cache directory and array 
34 is connected to bus 12 to receive instructions 
from memory. The instructions from the instruction 
cache 34 are provided to either a target instruction 
buffer 36 or sequential instruction buffer 38. Target 
instruction buffer 36 has a capacity of four in the 
. preferred embodiment while the sequential instruc- 
tion buffer 38 has a capacity of eight instructions. 
The target instructions in the target instruction buff- 
er 36 pass to the sequential buffer 38 if the target 
instructions have been selected for dispatching. 
The. interlock, dispatch, branch execution, and buff- 
er control unit 40 is connected to the sequential 
instruction buffer 38 and scans the contents of the 
sequential instruction buffer 38 to detect branch 
instructions. Also, on line 50, the multiplex network 
52 is controlled by the control logic 40 to dispatch 
two instructions over bus 16 to the fixed point 
processing unit 20. 

Upon determining that a branch instructions 
exists in the sequential instruction buffer 38, the 
control logic 40 through the multiplexer network 52 
provides the branch instruction to the unresolved 
branch register 54,if the branch is not resolved, and 
to the branch address generation unit 62. The 
branch address generation unit determines if the 
address of the target instruction of the branch can 
be computed. In certain instances the count regis- 
ter 58 or link register 60 are used in a computation 
of. the address of the target instruction. The ad- 
dress of the target instruction is then provided over 
bus 61 to the instruction fetch address register 30. 
When the instruction fetch address register 30 has 



been loaded with an address of a target instruction 
the previous address of the sequential instructions 
being fetched is transferred to the instruction fetch 
address register shadow 32 to store the address of 

6 the sequential instructions if in fact the branch to 
the target instruction is not taken. The condition 
register 56 is connected by bus 18 to the fixed 
point processing unit 20 and determines the result 
of a condition compare instruction execution that 

w determines whether a conditional branch is taken. 

The control logic 40 monitors the condition 
register 56 and upon determining that a conditional 
branch is to be taken, the control logic 40 provides 
target instructions from the target instruction buffer 

/5 36 to the sequential instruction buffer 38 to be 
dispatched. The control of the target instruction 
buffer and the sequential instruction buffer is pro- 
vided by lines 37 and 39 respectively. If the branch 
is not taken, the address of the sequential instruc- 

20 tion is restored from register 32 to register 30 and 
sequential instructions in the sequential instruction 
buffer 38 continue to be dispatched. The control of 
the target instruction buffer 36 and the sequential 
instruction buffer 38 is provided by lines 37 and 39 

25 respectively from the control logic 40. Additionally 
connected to control logic 40. is line 43 which 
provides the cancel conditional signal to the fixed 
point processing unit 20 indicating that the con- 
ditional instructions (the sequential instruction oc- 

30 curring after the branch instruction) are not to be 
executed. Line 41 provides the execute conditional 
signal to the fixed point processing unit 20 which 
indicates to the fixed point processing unit that 
these conditional instructions are to be executed. 

35 Figure 4 is a flow chart representing the opera- 
tion of the interlock, dispatch, branch execution and 
buffer control logic 40. In Figure 4 at the start of a 
cycle 100 step 106 determines if an unresolved 
branch is pending. This is accomplished by deter- 

40 mining if a unresolved branch pending bit has been 
set. If no unresolved branch is pending the flow 
proceeds to step 110 at which time the sequential 
instruction buffer 38 is scanned to determine if 
there are any branch instructions present. If not, 

45 the control flow proceeds to step 104 to fetch a 
new set of sequential instructions into the sequen- 
tial instruction buffer 38 and to dispatch sequential 
instructions from sequential instruction buffer 38. 
Upon completing step 104 the control flow returns 

so to step 100 to await the beginning of the next 
cycle. 

Returning to step 110, if a branch has been 
detected in the sequential instruction buffer 38 step 
116 is executed in which a determination is made 
55 as to whether a target address can be generated. If 
not, the control flow proceeds to step 104. If so, the 
flow proceeds to step 126 to determine if all pre- 
vious instructions have been dispatched. If all pre- 
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vious instructions before the branch have not been 
dispatched, the flow continues to step 132 to deter- 
mine if prefetching of branch target instructions has 
begun. If not. in step 146 the target address is 
placed into the instruction fetch address register 30 
and prefetching is begun. At this time the existing 
contents of the instruction fetch address register 30 
is placed into the instruction fetch address register 
shadow 32. Upon completing step 146 or if 
prefetching has already begun the control flow pro- 
ceeds to step 150 to dispatch instruction from the 
sequential instruction buffer 38. Control flow then 
proceeds to step 102 where branch target instruc- 
tions are fetched into the target instruction buffer 
36. Afterwards the flow returns to step 100 to await 
the beginning of the next cycle. 

Returning to step 126, if ail previous instruc- 
tions before the branch have been dispatched, the 
flow proceeds to step 130 to determine if it is 
known at this point if the branch will be taken. If the 
branch is known to be taken the control flow pro- 
ceeds to step 136 to determine if prefetching has 
begun. If not the control flow proceeds to step 144 
to place the target address into the instruction fetch 
address register 30 and the current sequential buff- 
er is invalidated. As before the original contents of 
register 30 is placed into register 32. The flow then 
proceeds to step 104. Returning to step 136 if 
prefetching has already begun the flow proceeds to 
step 142 to load the instructions from the target 
instruction buffer 36 into the sequential instruction 
buffer 38 and to stop prefetching.. Again the flow 
proceeds to step 104. Returning to step 130 if it is 
not known whether or not the branch will be taken 
the flow proceeds to step 134 to determine if it is 
known whether or not the branch will not be taken. 
If the branch is not taken the flow continues to step 
140 to determine if prefetching has begun. If not, 
the flow continues to step 104. However, if 
prefetching has begun, the flow proceeds to step 
148 to stop the prefetching and restore the instruc- 
tion fetch address register 30 with the contents of 
the instruction fetch address shadow 32. Upon 
completing step 148 the control flow proceeds to 
step 104. Returning to step 134, if it is not known if 
the branch will not be taken, the control flow pro- 
ceeds to step 138 to load the unresolved branch 
register 54 and to set an internal bit indicating that 
there is an unresolved branch pending. Also, in 
sequential instructions are dispatched from the se- 
quential buffer 38. These sequential instructions 
are for conditional execution, i.e., that they will be 
executed only if the branch is not taken. The con- 
trol flow then proceeds to step 102. 

Returning to step 106, if there is an unresolved 
branch pending the flow proceeds to step 108 to 
determine if the branch is resolved, in determining 
if the branch has been resolved, the control logic 



examines the contents of the condition register 56 
to determine if the condition code has been set 
that is required for the resolution of the branch in 
the unresolved branch register 54. If the branch 

s has not been resolved the flow proceeds to step 
114 to determine if prefetching has begun. If not, 
the flow continues to step 122 to determine if the 
target address can be generated. If the target ad- 
dress cannot be generated the program fiow pro- 

io ceeds to step 104. If the target address can be 
generated the program proceeds to step 124 to 
store the contents of the instruction fetch address 
register 30 into the instruction fetch address regis- 
ter shadow 32 and to place the generated target 

is address into the instruction fetch address register 
30 and to begin prefetching of branch target 
instructions. Proceeding to step 128 the dispatch- 
ing of sequential instructions is begun. These se- 
quential instructions are being dispatched for con- 

20 ditional execution, i.e., for execution upon the con- 
dition that the branch will not be taken. After step 
128 the flow proceeds to step 102. 

Returning to step 108, if the branch has been 
resolved the flow continues to step 112 to deter- 

25 mine if the branch has been taken. If the branch 
has not been taken then in step 120 an execution 
conditional signal on line 41 (Figure 1) will be 
provided to the fixed point processing unit 20. The 
instruction fetch address register 30 will be re- 

30 stored with the contents of the instruction fetch 
address register shadow 32 and prefetching will be 
halted. The control flow then proceeds to step 104. 
If the branch is taken then in step 118 a cancel 
conditional sequential instruction execution signal is 

35 provided on line 43 to the fixed point processing 
unit 20 and the branch target instructions in the 
target instruction buffer 36 are loaded into the 
sequential instruction buffer 38. Also prefetching is 
halted. Control flow then proceeds to step 104. 

40 

Claims 

1, A data processing system comprising: 
45 means for storing a sequence of instructions; 

means for determining if said instruction sequence 
includes a branch instruction; 
means responsive to said determining means for 
storing a sequence of branch target instructions in 
so said storing means; 

means for executing said branch instruction; 
means for dispatching instructions in said se- 
quence to a processor for execution and dispatch- 
ing instructions in said sequence after said branch 
55 instruction to said processor to be executed on 
condition that a branch is not taken before a deter- 
mination of whether said branch will be taken while 
simultaneously determining if said branch is to be 
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taken; and 

means for directing, if said branch is not taken t 
said processor to execute said instructions in se- 
quence after said branch instruction or, if said 
branch is taken, for dispatching said branch target 
instruction sequence to said processor for execu- 
tion. 

2. A data processing system comprising: 
means for fetching a plurality of instructions in a 
sequence from a memory; 

a first, buffer means for storing a portion of said 
fetched instructions; 

means for determining if a branch instruction is 

stored in said first buffer means; 

means connected to said determining means for 

fetching a. sequence of branch target instructions in 

response to a branch instruction in said first buffer 

means; 

a second buffer means for storing a portion of said 
fetched branch target instructions; 
means for dispatching instructions from said first 
buffer means to a processor for execution by said 
processor on condition that a branch is taken; 
means for executing said branch instruction and 
determining if said branch is to be taken; 
means . for indicating to said processor that said 
instructions dispatched on condition are to be ex- 
ecuted if said branch is not taken; 
means for dispatching instructions from said sec- 
ond buffer means to said processor if said branch 
is taken; and 

means for . indicating to said processor that said 
conditional instructions are not to be executed if 
said branch is taken. 

3. A data processing system comprising: 
means for storing a sequence of instructions; 
means for determining if said instruction sequence 
includes a conditional branch instruction dependent 
upon a result from a previous instruction in said 
sequence; 

means responsive to said determining means for 
storing a sequence of branch target instructions in 
said storing means; 

means for executing said branch instruction; 
means for dispatching instructions in said se- 
quence to a processor for execution and dispatch- 
ing instructions in said sequence after said branch 
instruction to said processor to be executed on 
condition, that a branch is not taken before a deter- 
mination of whether said branch will be taken while 
simultaneously determining if said branch is to be 
taken after said result from said previous instruc- 
tion in said sequence is determined; and 
means for directing if said branch is not taken, said 
processor to execute said instructions in sequence 
after said branch instruction or, if said branch is 
taken, for dispatching said branch target instruction 
sequence to said processor for execution. 



4. A data processing system comprising: 
means for storing a sequence of instructions; 
means for determining if said instruction sequence 
includes a branch instruction; 

s means for executing said branch instruction; 

means responsive to said determining means for 
storing a sequence of branch target instructions in 
said storing means while simultaneously dispatch- 
ing instructions in sequence to a processor for 

w execution and including means for dispatching 
instructions in sequence after said branch instruc- 
tion to said processor to be executed on condition 
that a branch is not taken before a determination of 
whether said branch will be taken; 

is means for determining if said branch is to be taken; 
and 

means for directing if said branch is not taken, said 
processor to execute said instructions in sequence 
after said branch instruction or, if said branch is 
20 taken, for dispatching said branch target instruction 
sequence to said processor for execution. 

5. A data processing system comprising: 
means for storing (10) a plurality of instructions in 
sequence; 

25 means for determining whether said plurality of 
instructions includes a branch instruction; 
means responsive to said determining means for 
storing a plurality of branch target instructions in 
sequence, when said firstmentioned plurality of 

30 instructions includes a branch instruction, 

means for dispatching one or more instructions to a 
processor (20) for execution, 
means for dispatching one or more subsequent 
instructions after the branch instruction in the se- 

35 quence, for execution on condition that the branch 
is not to be taken, 

means for executing said branch instruction, there- 
by determining whether the branch is to be taken, 
means responsive to said means for executing, for 
40 directing said processor to execute said subse- 
quent instructions if said branch is not to be taken, 
and 

means responsive to said means for executing, for 
dispatching one or more of said branch target 
45 instructions for execution, if said branch is to be 
taken. 

6. A data processing system as claimed in 
Claim 5 in which 

said one or more subsequent instructions are dis- 
so patched and said branch instruction is executed 
simultaneously. 

7. A data processing system as claimed in 
Claim 5, further comprising: 

means for fetching (14) one or more instructions in 
55 sequence, 

a first buffer for storing said fetched instructions, 
means for fetching one or more branch target 
instructions, and 
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a second buffer for storing said branch target 
instructions, and in which 

said determining means includes means for deter- 
mining whether a branch instruction is stored in 
said first buffer. 

8. A data processing system as claimed in 
Claim 5 or Claim 6, in which: 

said branch instruction is a conditional branch in- 
struction dependent on the result of a previous 
instruction in said sequence, and 
said branch instruction is executed after the result 
of said previous instruction has been determined. 

9. A data processing system as claimed in 
Claim 5 in which said branch target instructions are 
stored and said subsequent instructions are dis- 
patched simultaneously. 

10. A data processing system as claimed in 
any preceding claim wherein said means for stor- 
ing a sequence of instructions includes means for 
fetching a plurality of instructions simultaneously. 

11. A data processing system as claimed in 
any preceding claim wherein said dispatching 
means includes means to dispatch a plurality of 
instructions simultaneously. 

12. A data processing system as claimed in 
Claim 10 wherein said fetching means includes 
means of storing both said address of a first in- 
struction of said instructions in sequence after said 
branch instruction and a first instruction of said 
branch target instruction sequence. 

13. A data processing system as claimed in 
any preceding claim wherein said means for ex- 
ecuting said branch instruction includes means for 
determining a result of a previous sequential in- 
struction executed in said processor. 

14. A data processing system as claimed in 
any preceding claim wherein said storing means 
includes means for separately storing said branch 
target instruction sequence from other instructions. 

15. A data processing system as claimed in 
Claim 3 wherein said means for executing said 
branch instruction includes means for setting a 
condition register with the result of said previous 
instruction execution in said sequence and said 
means for determining if said branch is to be taken 
includes means for reading said condition register. 

16. A data processing system as claimed in 
any preceding claim wherein said means for deter- 
mining if said branch is to be taken includes means 
for storing a branch instruction dependent upon an 
unresolved condition, in a register until the unresol- 
ved condition is resolved and said means for dis- 
patching instructions for conditional execution con- 
tinues to dispatch said conditional execution 
instructions until said condition is resolved. 

17. A data processing system as claimed in 
Claim 16 wherein said means for determining if 
said branch is to be taken includes means for 



determining when said condition is resolved by 
determining when a processor executing an in- 
struction on which the branch instructions depends, 
stores a condition code to the means for determin- 
s ~ ing if said branch is to be taken. 

18. A pipelined data processing system com- 
prising: 

means for storing a sequence of instructions 
means for fetching a sequence of instructions dur- 

w ing a cycle; 

means for determining if said instruction sequence 
includes a branch instruction and in response to a 
determination of a branch instruction, storing a 
sequence of branch target instructions in said stor- 

75 ing means; 

means for dispatching a plurality of said fetched 
instructions in a cycle to a processor for execution 
and dispatching instructions in said sequence after 
said branch instruction to said processor to be 

20 executed on condition that a branch is not taken 
before a determination of whether said branch will 
be taken while simultaneously determining if said 
branch is to be taken after the result from a pre- 
vious instruction in said sequence is determined; 

25 means for decoding said dispatched instructions in 
a cycle; and 

means for executing said decoded instructions in a 
cycle and including means for directing, if said 
branch is not taken, said processor to execute said 
30 instructions in sequence after said branch instruc- 
tion or, if said branch is taken, for dispatching said 
branch target instruction sequence to said proces- 
sor for execution. 

19. A method of operation of a data processing 
as system comprising the steps of: 

storing a sequence of instructions; 

determining if said instruction sequence includes a 

branch instruction; 

if said sequence of instructions includes a branch 
40 instruction, storing a sequence of branch target 
instructions; 

dispatching instructions in said sequence to a pro- 
cessor for execution; 

dispatching instructions in said sequence after said 
45 branch instruction to said processor to be executed 
on condition that a branch is not taken before a 
determination of whether said branch will be taken 
while simultaneously determining if said branch is 
to be taken; and 
so directing, if said branch is not taken, said processor 
to execute said instructions in sequence after said 
branch instruction or, if said branch is taken, for 
dispatching said branch target instruction, sequence 
to said processor for execution. 
55 20. A method of operation, of a data processing 
system, comprising the steps of: 
fetching a plurality , of instructions in a sequence 
from a memory; 
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storing a portion of said fetched instructions; 
determining if a branch instruction is stored; 
fetching a sequence of branch target instructions in 
response to a branch instruction in said instruction 
sequence; 

storing a portion of said fetched branch target 
instructions; 

dispatching said first stored instructions to a pro- 
cessor for execution by a processor on condition 
that a branch is taken; 

executing said branch instruction and determining 
if said branch is to be taken; 
indicating to said processor that said instructions 
dispatched on condition are to be executed if said 
branch is not taken; 

dispatching said branch target instructions to said 
processor if said branch is taken; and 
indicating to said processor that said conditional 
instructions are not to be executed if said branch is 
taken. 

21 , A method of operation of a data processing 
system, comprising the steps of: 

storing a sequence of instructions; 

determining, if said instruction sequence includes a 

conditional branch instruction dependent upon a 

result from a previous instruction in said sequence; 

storing a sequence of branch target instructions if a 

conditional branch instruction is in said instruction 

sequence; 

dispatching instructions in said sequence to a pro- 
cessor for execution and dispatching instructions in 
said sequence after said branch instruction to said 
processor to be executed on condition that said 
branch is not taken before a determination of 
whether said branch will be taken while simulta- 
neously determining if said branch is to be taken 
after said result from said previous instruction in 
said sequence is determined; and 
directing, if said branch is not taken, said processor 
to execute said instructions in sequence after said 
branch instruction or, if said branch is taken, dis- 
patching said branch target instruction sequence to 
said processor for execution. 

22. A method of operation of a data processing 
system, comprising the steps of: 

storing a sequence of instructions; 

determining if said instruction sequence includes a 

branch instruction; 

storing a sequence of branch target instructions if a 
branch instruction is in said instruction sequence 
while simultaneously dispatching instructions in se- 
quence to a processor for execution and dispatch- 
ing instructions in sequence after said branch in- 
struction to said processor to be executed on con- 
dition that a branch is not taken before a deter- 
mination of whether said branch will be taken; 
determining if said branch is to be taken; and 
directing, if said branch is not taken, said processor 



to execute said instructions in sequence after said 
branch instruction or, if said branch is taken, dis- 
patching said branch target instruction sequence to 
said processor for execution. 

5 23. A method of operation of a pipelined data 

processing system, comprising the steps of: 
fetching a sequence of instructions during a cycle; 
determining if said instruction sequence includes a 
branch instruction and in response to a determina- 

io tion of a branch instruction, storing a sequence of 
branch target instructions; 
dispatching a plurality of said fetched instructions 
in a cycle to a processor for execution and dis- 
patching instructions in said sequence after said 

/5 branch instruction to said processor to be executed 
on condition that a branch is not taken before a 
determination of whether said branch will be taken 
while simultaneously determining if said branch is 
to be taken after the result from a previous instruc- 

20 tion in said sequence is determined; 

decoding said dispatched instructions in a cycle; 
and 

executing said decoded instructions in a cycle and 
including a step of directing, if said branch is not 
25 taken, said processor to execute said instructions 
in sequence after said branch instruction or, if said 
branch is taken, dispatching said branch target 
instruction sequence to said processor for execu- 
tion. 

so 
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